skip to main content


Search for: All records

Creators/Authors contains: "Ellington, Andrew D."

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract

    Deep learning models are seeing increased use as methods to predict mutational effects or allowed mutations in proteins. The models commonly used for these purposes include large language models (LLMs) and 3D Convolutional Neural Networks (CNNs). These two model types have very different architectures and are commonly trained on different representations of proteins. LLMs make use of the transformer architecture and are trained purely on protein sequences whereas 3D CNNs are trained on voxelized representations of local protein structure. While comparable overall prediction accuracies have been reported for both types of models, it is not known to what extent these models make comparable specific predictions and/or generalize protein biochemistry in similar ways. Here, we perform a systematic comparison of two LLMs and two structure-based models (CNNs) and show that the different model types have distinct strengths and weaknesses. The overall prediction accuracies are largely uncorrelated between the sequence- and structure-based models. Overall, the two structure-based models are better at predicting buried aliphatic and hydrophobic residues whereas the two LLMs are better at predicting solvent-exposed polar and charged amino acids. Finally, we find that a combined model that takes the individual model predictions as input can leverage these individual model strengths and results in significantly improved overall prediction accuracy.

     
    more » « less
  2. Abstract

    A growing interest in aptamer research, as evidenced by the increase in aptamer publications over the years, has led to calls for a go-to site for aptamer information. A comprehensive, publicly available aptamer dataset, which may be a repository for aptamer data, standardize aptamer reporting, and generate opportunities to expand current research in the field, could meet such a demand. There have been several attempts to create aptamer databases; however, most have been abandoned or removed entirely from public view. Inspired by previous efforts, we have published the UTexas Aptamer Database, https://sites.utexas.edu/aptamerdatabase, which includes a publicly available aptamer dataset and a searchable database containing a subset of all aptamer data collected to date (1990–2022). The dataset contains aptamer sequences, binding and selection information. The information is regularly reviewed internally to ensure accuracy and consistency across all entries. To support the continued curation and review of aptamer sequence information, we have implemented sustaining mechanisms, including researcher training protocols, an aptamer submission form, data stored separately from the database platform, and a growing team of researchers committed to updating the database. Currently, the UTexas Aptamer Database is the largest in terms of the number of aptamer sequences with 1,443 internally reviewed aptamer records.

     
    more » « less
  3. We demonstrate in vitro incorporation of cyclic β-amino acids into peptides by the ribosome through genetic code reprogramming. Further, we show that incorporation efficiency can be increased through the addition of elongation factor P. 
    more » « less
  4. null (Ed.)
    In the search for life beyond Earth, distinguishing the living from the non-living is paramount. However, this distinction is often elusive, as the origin of life is likely a stepwise evolutionary process, not a singular event. Regardless of the favored origin of life model, an inherent “grayness” blurs the theorized threshold defining life. Here, we explore the ambiguities between the biotic and the abiotic at the origin of life. The role of grayness extends into later transitions as well. By recognizing the limitations posed by grayness, life detection researchers will be better able to develop methods sensitive to prebiotic chemical systems and life with alternative biochemistries. 
    more » « less
  5. Artificially designed molecular systems with programmable behaviors have become a valuable tool in chemistry, biology, material science, and medicine. Although information processing in biological regulatory pathways is remarkably robust to error, it remains a challenge to design molecular systems that are similarly robust. With functionality determined entirely by secondary structure of DNA, strand displacement has emerged as a uniquely versatile building block for cell-free biochemical networks. Here, we experimentally investigate a design principle to reduce undesired triggering in the absence of input (leak), a side reaction that critically reduces sensitivity and disrupts the behavior of strand displacement cascades. Inspired by error correction methods exploiting redundancy in electrical engineering, we ensure a higher-energy penalty to leak via logical redundancy. Our design strategy is, in principle, capable of reducing leak to arbitrarily low levels, and we experimentally test two levels of leak reduction for a core “translator” component that converts a signal of one sequence into that of another. We show that the leak was not measurable in the high-redundancy scheme, even for concentrations that are up to 100 times larger than typical. Beyond a single translator, we constructed a fast and low-leak translator cascade of nine strand displacement steps and a logic OR gate circuit consisting of 10 translators, showing that our design principle can be used to effectively reduce leak in more complex chemical systems. 
    more » « less
  6. Rasmussen, Angela L. (Ed.)
    ABSTRACT Isothermal nucleic acid amplification tests (iNATs), such as loop-mediated isothermal amplification (LAMP), are good alternatives to PCR-based amplification assays, especially for point-of-care and low-resource use, in part because they can be carried out with relatively simple instrumentation. However, iNATs can often generate spurious amplicons, especially in the absence of target sequences, resulting in false-positive results. This is especially true if signals are based on non-sequence-specific probes, such as intercalating dyes or pH changes. In addition, pathogens often prove to be moving, evolving targets and can accumulate mutations that will lead to inefficient primer binding and thus false-negative results. Multiplex assays targeting different regions of the analyte and logical signal readout using sequence-specific probes can help to reduce both false negatives and false positives. Here, we describe rapid conversion of three previously described SARS-CoV-2 LAMP assays that relied on a non-sequence-specific readout into individual and multiplex one-pot assays that can be visually read using sequence-specific oligonucleotide strand exchange (OSD) probes. We describe both fluorescence-based and Boolean logic-gated colorimetric lateral flow readout methods and demonstrate detection of SARS-CoV-2 virions in crude human saliva. IMPORTANCE One of the key approaches to treatment and control of infectious diseases, such as COVID-19, is accurate and rapid diagnostics that is widely deployable in a timely and scalable manner. To achieve this, it is essential to go beyond the traditional gold standard of quantitative PCR (qPCR) that is often faced with difficulties in scaling due to the complexity of infrastructure and human resource requirements. Isothermal nucleic acid amplification methods, such as loop-mediated isothermal amplification (LAMP), have been long pursued as ideal, low-tech alternatives for rapid, portable testing. However, isothermal approaches often suffer from false signals due to employment of nonspecific readout methods. We describe general principles for rapidly converting nonspecifically read LAMP assays into assays that are read in a sequence-specific manner by using oligonucleotide strand displacement (OSD) probes. We also demonstrate that inclusion of OSD probes in LAMP assays maintains the simplicity of one-pot assays and a visual yes/no readout by using fluorescence or colorimetric lateral-flow dipsticks while providing accurate sequence-specific readout and the ability to logically query multiplex amplicons for redundancy or copresence. These principles not only yielded high-surety isothermal assays for SARS-CoV-2 but might also aid in the design of more sophisticated molecular assays for other analytes. 
    more » « less